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EXPRESSION OF FOREIGN GENES USING A REPLICATING PCLYPROIEEN FRODUCING VMJS VECTOR 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates to polyprotein-producing viral vectors for 
production of non-native proteins and which employ proteolytic digestion of the non-native 
5 proteins by the viral vector's own proteases. 

2. Description of Related Art 

Recombinant DNA techniques and genetic technologies have led to progress 
in the development of gene transfer into organisms, both plant and animal. Investigators 
have attempted to produce pharmaceuticals, chemicals, and biologicals by gene transfer 

0 techniques. Even though a gene may have been identified, cloned, and engineered, it is still 

necessary to introduce the gene into a host cell in which the gene may be expressed. 

Foreign genes may be expressed in the host from DNA integrated into the host 
genome (transgenic organism), or from extrachromosomal pieces of DNA or RNA which are 
not inherited (through transient expression vectors). A variety of methods have been 

5 developed and are currently available for the transformation of various cells with genetic 

material. The most success has been achieved with dicotyledonous plants with some limited 
success reported for the monocotyledonous plants. The most highly developed system for 
higher plants transformation is derived from the tumor-inducing mechanism of the soil 
bacterium Agrobacterium tumefaciens. The AgrobacteriumlTi plasmid system exploits the 

» elements of the Agrobacterium transformation mechanism. Alternative methods for 

delivering genetic material to plants include the direct incorporation of DNA such as the 
transformation of plant cell protoplast. Other techniques include microinjection or DNA 
bombardment. 

Transient expression vectors can be introduced directly into the recipient cell, 
and result in gene expression in a short period of time, generally 24-48 hours. Viral vectors 
(both DNA and RNA) offer possibilities as plant and animal transformation vectors. Viral 
vectors have some advantages over vectors involving non-viral DNA in certain circumstances. 
Advantages of viral vectors include improvements in the ease of gene introduction. 

Viruses that have been shown to be useful for the transformation of plant hosts 
include caulimovirus (CaMV), tobacco mosaic virus (TMV) and brome mosaic virus (BMV). 

- 1 - 
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Transformation of plants using plant viruses is described in US 4.855.237 (bean golden 
mosaic virus), EP-A 67,553 (TMV), Japanese Published Application No. 63-14693 (TMV). 
EPA 194.809 (BMV), EPA 278,667 (BV), Brisson. N. et al.. Met. Enzymol. 1 18:659 (1986) 
(CaMV), Guzman et al. Commun. Molec. Biol.: Viral Vectors. Cold Spring Harbor 
Laboratory, New York, pp. 172-189 (1988), and WO 93/03161 (TMV). Pseudovirus 
particles for use in expressing foreign DNA in many hosts, including plants, is described in 
WO 87/06261. 

For RNA viruses, a primary consideration is designing a vector that produces 
infectious transcripts, and therefore preserves the terminal structure of the RNA in order for 
the transcript to be a good substrate for the viral replicase. Bacteriophage promoters have 
proven ideal for directing the synthesis of RNA from DNA clones. One RNA virus 
expression vector, pPMl, utilized a modified lambda phage P f promoter. This vector was 
used to synthesize infectious transcripts of brome mosaic virus (BMV) genomic RNA from 
BMV cDNA clones. However, most of these viral vectors have not been capable of systemic 
spread in the plant and expression of the non- viral foreign genes in the majority of the plant 
cells in the whole plant. Another disadvantage of many of the prior art viral vectors is that 
they are not stable for the maintenance of non-viral foreign genes. The use of RNA vectors 
has also been limited by the instability of inserted foreign sequences and disruption of 
systemic virus spread after replacement of virus genes involved in movement. TMV viral 
vectors require a subgenomic promoter for operation. The prior art viral vectors do not 
provide a means for using viral proteases to cleave the foreign proteins made by the 
transformed plants. Consequently, there is a need for vectors which enable enhanced 
systemic expression of a nucleic acid sequence after a short inoculation time and for vectors 
which can be used to produce both native and non-native proteins relative to the host cell, 
particularly plant cells, while allowing cleavage of the produced protein by viral proteases. 
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SUMMARY OF THE INVENTION 

It is an object of the invention to provide a polyprotein-producing virus 
expression vector adapted to encoding at least one sequence for at least one heterologous 
protein. The vector may be used to express heterologous proteins in plants or plant cells. 

Accordingly, the present invention provides a viral expression vector encoding 
for at least one protein non-native to the vector that is released from at least one polyprotein 
expressed by said vector by proteolytic processing catalyzed by at least one protease in said 
polyprotein, said vector comprising: (a) at least one promoter; (b) cDNA, wherein said cDNA 
comprises a cDNA sequence which codes for at least one polyprotein from a polyprotein- 
producing virus; (c) at least one unique restriction site flanking a 3' terminus of said cDNA; 
and (d) a cloning vehicle. 

The present invention also provides a viral expression vector encoding for at 
least one protein non-native to the vector that is released from at least one polyprotein 
expressed by said vector by proteolytic processing catalyzed by at least one protease in said 
polyprotein, said vector comprising: (a) at least one promoter; (b) cDNA, wherein said cDNA 
comprises a cDNA sequence which codes for at least one polyprotein from a polyprotein- 
producing virus, and wherein said polyprotein comprises at least one protein non-native to 
the vector; (c) at least one unique restriction site flanking a 3' terminus of said cDNA; and 
(d) a cloning vehicle. 

The present invention also provides a method for producing an expression 
vector encoding for at least one selected protein non-native to the vector that is released from 
at least one polyprotein expressed by said vector by proteolytic processing catalyzed by 
proteases in said polyprotein, said method comprising: (a) reverse transcribing a polyprotein- 
producing RNA into cDNA; (b) introducing at least one unique restriction site flanking a 3' 
terminus of said cDNA; and (c) inserting said cDNA into a cloning vehicle. 

The present invention also provides a method for producing an expression 
vector, adapted for expressing in a host cell at least one selected protein non-native to the 
vector that can be released from at least one polyprotein by proteolytic processing catalyzed 
by proteases in said polyprotein, said method comprising: (a) reverse transcribing a 
polyprotein-producing RNA into a first cDNA; (b) introducing at least one unique restriction 
site flanking a 3' terminus of said first cDNA; and (c) inserting into said first cDNA a second 
cDNA sequence wherein said second cDNA sequence encodes a protein non-native to the 
vector; and (d) inserting said first and second cDNA into a cloning vehicle. 

-> 
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The present invention also provides a method for expressing at least one 
protein in a host cell, wherein the protein is non-native to the host cell, said method 
comprising infecting a host cell susceptible to a polyprotein-producing virus with an 
expression vector, wherein said vector comprises a cDNA sequence encoding at least one 
5 polyprotein, each of said polyprotein comprising at least one protein non-native to the vector 

and at least one protease, and wherein said protein non-native to the vector is released from 
said polyprotein by proteolytic processing catalyzed by said protease encoded by said cDNA, 
and expressing said protein non-native to the vector in said host cell. 

These and other advantages of the present invention will become apparent from 
10 the following detailed description. 
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BRIEF DESCRIP TION OF THE DRAWINGS 
Figure 1 depicts diagrams of portions of plasmids containing cDNA 
representing the complete TEV genome and inserted GUS gene. 

Figure 2 depicts iirurronoblot analysis of extracts from RNA transcript- 
inoculated tobacco plants. 

Figure 3 depicts immunoblot analysis of extracts from tobacco plants infected 
by serially passaged TEV-GUS. 

Figure 4(A) depicts analysis of GUS-HC-Pro fusion protein and GUS activity 
in aging plants infected by TEV-GUS. Figure 4(B) depicts GUS activities as determined by 
fluorometric assay with the extracts analyzed in Figure 4(A). 

Figure 5 depicts genetic maps of TEV and selected TEV variants used in this 
study. Figure 5(A) shows wild-type TEV-HAT as derived from transcription of pTEV7DA. 
Figure 5(B) depicts TEV-GUS and deletion variants. 

Figure 6 depicts immunoblot analysis of extracts from upper leaves of TEV- 
GUS-inoculated tobacco plants using anti-HC-Pro sera. 

Figure 7 depicts relative positions of deletion endpoints within the GUS and 
HC-Pro coding sequences in spontaneous mutants of TEV-GUS. 

Figure 8 depicts immunoblot analysis with anti-HC-Pro serum of extracts from 
plants inoculated with synthetic RNA transcripts. Figure 8(A) depicts extracts from TEV- 
HAT r and TEV-7del r Figure 8(B) depicts extracts from a mock-inoculated plant, wild-type 
TEV-HAT, TEV-2del, synthetic RNA transcripts from pTEV7DA-2del r pTEV7DA-AGUS, 
and pTEV7DA-AHC. 

Figure 9 depicts quantitation of TEV capsid protein (A, B) and RNA (C, D) 
present in systemically infected leaves of plants inoculated with wild-type and mutant viruses. 
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DETAILED DESCRIPTION OF THE INVENTION 
The present invention relates generally to viral vector expression systems for 
inserting selected genes into host cells. The present invention facilitates the production of 
selected proteins using a polyprotein-producing virus vector. As used herein, the term 

5 "polyprotein-producing virus" means a virus from the picorna-like supergroup as. for 

example, those described in Doija, V.V. and Carrington, J.C., Seminars in Virology 3: 315- 
326 (1992), incorporated herein by reference, which produces a polyprotein. The picorna-like 
supergroup of viruses encompass both plant and animal viruses. Picorna-like plant viruses 
are the most preferred. "Polyprotein" means a protein encoded by a virus from the picorna- 

1 o like supergroup that can be post-translationally cleaved by one or more proteases. 

Polyprotein-producing plant viruses include, but are not limited to, potyviruses, 
nepoviruses. and comoviruses. Potyviruses useful in the present invention include, but are 
not limited to, tobacco etch potyvirus (TEV), and the viruses referenced in Edwardson, J.R. 
and Christie, R.G., "The Potyvirus Group: Monograph No. 16 (Agric. Exp. Station, Univ. of 

15 Florida, 1991), incorporated by reference herein. Animal viruses from the picorna-like 

supergroup that produce polyproteins include, but are not limited to picorna viruses and polio 
viruses. 

As used herein the term "vector" refers to a vector produced with a 
polyprotein-producing plant virus cDNA and a cloning vehicle. In a preferred embodiment 

20 the vector includes a coding sequence non-native to the viral vector inserted between coding 

sequences of the viral genome. Vectors useful in the present invention generate a non-native 
protein relative to the virus in the host as part of a polyprotein. Native viral proteases 
present in the vector cleave the polyprotein into individual proteins. "Protein non-native to 
the viral vector" or "non-native protein" as used herein means a protein(s) or polypeptide(s) 

25 that is not expressed by the wild-type virus. A promoter is a DNA sequence that binds RNA 

polymerase and facilitates initiation of transcription. 

In a preferred embodiment, the polyprotein-producing virus is TEV. TEV is 
a well-characterized potyvirus, and contains a positive-strand RNA genome of 9.5 kilobases 
(kb) coding for a single, large polyprotein that is processed by three virus-specific 

30 proteinases. The nuclear inclusion protein "a" proteinase is involved in the maturation of 

several replication-associated proteins and capsid protein. The helper component-proteinase 
(HC-Pro) and 35-kDa proteinase both catalyze cleavage only at their respective C- termini. 
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The proteolytic domain in each of these proteins is located near the C- terminus. The 35-kDa 
proteinase and HC-Pro derive from the N-terminal region of the TEV polyprotein. 

The different polyprotein-producing viruses have different proteinases, but all 
are considered to cleave both native and non-native proteins from the respective polyproteins. 

The present invention demonstrates that polyprotein-producing virus vectors 
in general, preferably potyvirus vectors, and most preferably TEV vectors, serve as efficient 
expression vectors. The theoretical yield of protein products encoded by a polyprotein- 
producing vector is extremely high because all virus genome-encoded proteins are synthesized 
in equimolar amounts. 

Introduction of the vector into the host results in production of the non-native 
protein contained on the vector during the course of systemic infection of the recombinant 
virus. The gene encoding the non-native protein is introduced into the virus genome such 
that the non-native protein is produced as part of a polyprotein precursor with other native 
viral proteins. Preferably, the sequence encoding the non-native protein is inserted between 
coding sequences for native proteins of the viral genome. The sequence encoding the non- 
native protein must have flanking cleavage sites to facilitate removal of the non-native 
protein from the polyprotein. The engineering of site-specific cleavage sites recognized by 
the viral proteinases permits proteolytic release of the non-native protein from the 
polyprotein. Techniques commonly used in the art to effect site directed mutagenesis may 
be used. These include, for example, the methods described by Kunkel, T.A., et al., Methods 
Enzymol., 154:367-382 (1982). 

In the case where a plant is used as the host cell, the vector should preferably 
be aphid transmission-defective to prevent transmission of the recombinant vector to other 
plants. This is accomplished by inactivation of viral components necessary for insect 
transmissibility, such as, for example, the HC-Pro proteinase in TEV. Introduction of 
mutations, such as deletions at sites in the insect-transmissible dependent proteins leads to 
transmission-defective vectors. Mutations may be generated by techniques known to those 
skilled in the art. 

The method for generating a vector comprises reverse transcription of a 
polyprotein-producing viral RNA by any reverse transcription method commonly used in the 
art to produce a cDNA. The cDNA is preferably full length, but partial cDNA may also be 
used, as long as the cDNA is able to code for a functional polyprotein. Double-stranded 
DNA (d.s. DNA) is generated from the cDNA using any technique known to those skilled 
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in the art. such as. but not limited to. bacterial DNA polymerase and RNase H. Unique 
restriction endonuclease sites may be added. The d.s. DNA is digested with a suitable 
restriction endonuclease and the digest is inserted into a cloning vehicle (which may have at 
least one unique restriction endonuclease site) such as, but not limited to. a plasmid. cosmid 
or lambda. Resulting expression vectors are propagated in host cells using techniques known 
to those skilled in the art. 

Unique restriction sites are added to the gene encoding the non-native protein 
to be inserted into the vector. These restriction sites may code for cleavage at the N- or C- 
termini of the non-native protein (i.e., at the 5' or 3' end). In one embodiment, restriction 
sites are generated for cleavage by the 35K (PI) protease to maturate the protein. In an 
alternative embodiment, sites for cleavage at the C- terminus by NIa protease may be 
introduced. 

The gene coding for a protein non-native to the viral vector is inserted into a 
specific site on the resulting cloning vehicle using any technique known to those skilled in 
the art for introducing foreign genes into cloning vehicles. The transcripts may be applied 
to recipient cells by any technique known to those skilled in the art. These include, but are 
not limited to, manual application, such as abrasive inoculation, particle bombardment, and/or 
electroporation. 

As used herein, the phrase, "recipient cell" or "host cell" is intended to refer 
to cells that are susceptible to infection by a polyprotein-producing virus. Species from, for 
example, the Solanaceae, Chenopodiaceae, Leguminosae, Amaranthaceae, Compositae, 
Campanulaceae, Scrophulariaceae, Convolvuiaceae, Lobeliaceae families and others which 
are known to be infected by potyvirus may be transformed. For example, Table 1 shows the 
species and Families which are known to be infected by TEV as described by Edwardson, 
J.R., and Christie, R.G., previously incorporated by reference. Additional species may be 
found which are also susceptible to TEV infection. 



TABLE 1. Species Reported to be Infected by Tobacco Etch Virus 


Species 


Families 


Species 


Families 


Alonsoa linearis 


Scrophulariaceae 


N. caudigera 


Solanaceae 


Amaranthus caudatus 


Amaranth aceae 


N. clevelandii 


Solanaceae 


Atriplex hortensis 


Chenopodiaceae 


N. digluta 


Solanaceae 
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Beta vui saris 



Chenopodiaceae 



Brachycome iberidifolia Compositae 



Browallia major 



Solanaceae 



B. speciosa 



Solanaceae 



Callistephus chinensis Compositae 



Campanula drabifolia Campanulaceae 
Capsicum annuum 



C. frutescens 



C. microcarpum 



Solanaceae 
Solanaceae 



Solanaceae 



C. pendulum 



Solanaceae 



Cassia occidentalis 



Leguminosae 



C. tora 



Leguminosae 



Celosia areentia 



Amaranthaceae 



C. foetidum 



Chenopodiaceae 



C. quinoa 



Chenopodiaceae 



Cirsium vulgare 



Compositae 



Coliinsia bicolor 



Scrophulariaceae 



C. heterophylla 



Scrophulariaceae 



Cuscuta californica 



Convolvulaceae 



C. lupulifornis 



Convolvulaceae 



Cymbalaria muralis 



Scrophulariaceae 



Datura ferox 



Solanaceae 



D. metel 



Solanaceae 



D. meteloides 



Solanaceae 



D. stramonium 



Solanaceae 



Dimorphotheca 
aurantiaca 



Compositae 



D. pluvialis 



Compositae 



D. sinuata 



Compositae 



Emmenanthe 
penduliflora 



Hydrophyilaceae 



N. x edwardsonii 



N. glauca 



N. glutinosa 



K langsdorfii 



N. longiflora 



N. nudicaulis 



N. paniculata 



N. pauciflora 



N. plumbaginifolia 



N. raimondii 



N. repanda 



N. rustica 



Charieis heterophylla | Com positae 
Chenopodium album [ Chen opodiaceae 
C. amaranticolor | Chenopodiaceae 



N. sanderae 



N. solanifolia 



N. svivestris 



N. tabacum 



N. tomentosa 



N. trigonophylla 



N. undulata 



Niermbergia hippomanica 



Nolana lanceolata 



Penstemon grandiflorus 



Solanaceae 



Solanaceae 



Solanaceae 



Solanaceae 



Solanaceae 



Solanaceae 



Solanaceae 



Solanaceae 



Solanaceae 



Solanaceae 



Solanaceae 



Solanaceae 



Solanaceae 



Solanaceae 



Solanaceae 



Solanaceae 



Solanaceae 



Solanaceae 



Solanaceae 



Solanaceae 



Solanaceae 



Petunia hybrida 



P. violaceae 



Phacelia campanularia 



P. ciliata 



P. grandiflora 



P. viscida 



P. whitiavia 



Physalis alkekengi 



P. ansulata 



P. ciliosa 



Scrophulariaceae 



Solanaceae 



Solanaceae 



Hydrophyilaceae 



Hydrophyilaceae 



Hydrophyilaceae 



Hydrophyilaceae 



Hydrophyilaceae 



Solanaceae 



Solanaceae 



Solanaceae 
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3D 



Eupatorium iasseauxii 


Compositae 


P. floridana 


Solanaceae 


Gamolepis tagetes 


Compositae 


P. heterophylla 


Solanaceae 


Gomphrena globosa 


Amaranthaceae 


P. ixocarpa 


Solanaceae 


Gypsophila elegans 


Caryophyllaceae 


P. peruviana 


Solanaceae 


Helianthus annuus 


Compositae 


P. pruinosa 


Solanaceae 


Helipterum 
humboldtianum 


Compositae 


P. pubescens 


Solanaceae 


Hyoscyamus niger 


Solanaceae 


P. subglabrata 


Solanaceae 


Indigofera hirsuta 


Leguminosae 


P I an tag o lan ceo lata 


Plantaginaceae 


Lamium amplexicaule 


Labiatae 


Portulaca oleracea 


Portulacaceae 


Linaria bipartita 


Scrophulariaceae 


Primula malacoides 


Primulaceae 


L. canadensis 


Scrophulariaceae 


Proboscidea jussieui 


Martynaceae 


L. maroccana 


Scrophulariaceae 


Rudbekia amplexicaulis 


Compositae 


Lobelia erinus 


Lobeliaceae 


Salpiglosis sinuata 


Solanaceae 


L. gracilis 


Lobeliaceae 


Schizanthus pinnatus 


Solanaceae 


L. hederacea 


Lobeliaceae 


Senecio vulgaris 


Compositae 


L. inflata 


Lobeliaceae 


Silene angelica 


Caryophyllaceae 


L. tenuior 


Lobeliaceae 


Solanum aculeatissimum 


Solanaceae 


Lycium chinensis 


Solanaceae 


S. capsicastrum 


Solanaceae 


Lycopersicon 
esculentum 


Solanaceae 


S. carolinense 


Solanaceae 


L. hirsutum 


Solanaceae 


S. elaegnifoiium 


Solanaceae 


L. peruvianum 


Solanaceae 


S. integrifolium 


Solanaceae 


L. pimpinellifolium 


Solanaceae 


S. melongena 


Solanaceae 


Margaranthus 
solanaceus 


Solanaceae 


S. nigrum 


Solanaceae 


Melilotus albus 


Leguminosae 


S. pseudocapsicum 


Solanaceae 


M. indicus 


Leguminosae 


S. seaforthianum 


Solanaceae 


M. italicus 


Leguminosae 


S. tuberosum 


Solanaceae 


M. messanensis 


Leguminosae 


Spinacia oleracea 


Chenopodiaceae 


M. officinalis 


Leguminosae 


Tetragonia expansa 


Aizoaceae 


M. sulcatus 


Leguminosae 


Torenia fournieri 


Scrophulariaceae 


M. wolgicus 


Leguminosae 


Trigonella calliceras 


Leguminosae 


Mollugo verticillata 


Aizoaceae 


T. coerulea 


leguminosae 


Nemophilia insignis 


Hydrophyllaceae 


T. cornicutata 


Leguminosae 
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N. maculata 


Hydrophyllaceae 


*T. cretica 


L_ca;urninosae 


N. merziesii 


Hydrophyllaceae 


T. foenum-eraecum 


I Pffn m inncao 


Nicandra physaioides 


Solanaceae 


Valerianella locusta 


» aicicuiaCCac 


Nicotiana acuminata 


Solanaceae 


Verbena canadensis 


Verbenaceae 


N. alata 


Solanaceae 


V. hybrida 


Verbenaceae 


N. benthamiana 


Solanaceae 


Zaluzianskya viliosa 


Scrophulariaceae 


N. bigelovii 


Solanaceae 


Zinnia elegans 


Compositae 


N. bonariensis 


Solanaceae 
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Host cells may be in plants, plant cell cultures, animals or animal cell cultures. Cell cultures 
include any conventional in vitro cell culture such as, but not limited to, roller bottle, agar, 
and bioreactor techniques. The host cell may be a plant or an animal cell. It is anticipated 
that vectors having plant polyprotein-producing viral RNA will infect susceptible plant cells, 
and that vectors having animal polyprotein-producing viral RNA will infect susceptible 
animal cells, although some plant host cells may be susceptible to animal polyprotein- 
producing viral RNA, and some animal host cells may be susceptible to plant polyprotein- 
producing viral RNA. "Cell" as used herein may be a single cell, a plurality of cells, or an 
organism. 

In order to improve the ability to identify transformants, one may desire to 
employ selectable or screenable marker gene as, or in addition to, the expressible gene of 
interest. Such genes may encode either a selectable or screenable marker, depending on 
whether the marker confers a trait which one can select for by chemical means, i.e., through 
the use of a selective agent such as an herbicide, antibiotic or the like, or whether it is simply 
a trait that one can identify through observation or testing (e.g., the R-locus trait). Many 
examples of suitable marker genes are known to the art and can be employed in the practice 
of the present invention. 
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Possible selectable markers for use in connection with the present invention 
include, but are not limited to. a mo gene which codes for kanamycin resistance and can be 
selected for using kanamycin, G418, etc.; a bar gene which codes for bialaphos resistance, 
a mutant EPSP synthase gene which encodes glyphosate resistance; etc. Exemplary 
5 screenable markers include beta-glucruonidase (GUS), or an R-locus gene, which encodes a 

product that regulates the production of anthocyanin pigments (red color) in host cells. 
Included within the terms "selectable" or "screenable marker" genes are also genes which 
encode a secretable marker whose secretion can be detected as a means of identifying or 
selecting for transformed host cells. Examples include markers which are able to secrete 
10 antigen(s) that can be identified by antibody interaction, or an enzyme(s) which can be 

detected catalytically. 

The choice of the particular foreign DNA segments to be delivered to the 
recipient host cells will often depend on the purpose of the transformation. 

The vector is used to generate proteins of interest that are not native to the 
15 virus in a host cell. This includes the production of important proteins or other products for 

commercial use, such as lipase, melanin, pigments, antibodies, hormones, pharmaceuticals 
such as, but not limited to, interleukins, EPO, G-CSF, GM-CSF, hPG-CSF, M-CSF. Factor 
VIII, Factor IX, tPA, hGH, receptors, insulin, vaccines, antibiotics and the like. The coding 
sequences for proteins that can be used are known in the art or can be obtained by standard 
20 sequencing techniques. Alternatively, the vector may be used to produce an enzyme that is 

able to convert a natural product to a unique product. This includes, for example, the 
production of secondary metabolites useful as pharmaceuticals. Alternatively, the vector may 
be used to produce degradative or inhibitory enzymes. 

In an alternative embodiment, the vector is used for the transformation of crop 
25 plants to add some commercially desirable, agronomically important traits to the plant. Such 
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traits include, but are not limited to. herbicide resistance, increased yields, insect and disease 
resistance, physical appearance, food content and makeup, etc. For example, one may desire 
to incorporate one or more genes encoding herbicide resistance. The bar and glyphosate 
tolerant EPSP synthase genes are good examples. A potential insect resistance gene which 
can be introduced includes the Bacillus thuringiensis crystal toxin gene, which may provide 
resistance to pests such as lepidopteran or coleopteran. 

Genes encoding proteins characterized as having potential insecticidal activity, 
such as the cowpea trypsin inhibitor (CpTI); may find use as a rootworm deterrent; genes 
encoding avermectin may prove particularly useful as a corn rootworm deterrent. 
Furthermore, genes encoding lectins may confer insecticide properties (e.g., barley, wheat 
germ agglutinin, rice lectins) while others may confer antifungal properties (e.g., hevein or 
chitinase). 

The following examples illustrate the teachings of the present invention and 
are not intended to limit the scope of the invention. In particular, although the following 
examples are limited to plant viruses and plant hosts, the present invention also encompasses 
recombinant picoma-like animal virus vectors and the animal hosts injected by the animal 



virus. 
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Example 1 

This example demonstrates the production of a potyvirus vector containing a 
GUS marker, as also described in Dolja, V.V., et al.. Proc. Natl Acad. Sci.. 89: 10208- 
10212 (1992), incorporated by reference herein. 
5 A DNA copy of TEV RNA (ATCC PV-69) was synthesized with reverse 

transcriptase (Superscript from GIBCO-BRL) and a primer complementary to 23 nucleotides 
(nt) preceding the 3'-terminal poly(A) tail of the virus genome. Double-stranded DNA was 
generated using Escherichia coli DNA polymerase I and RNase H, digested with 5srEII, and 
inserted into &rtEII-digested pTL7SN.3-0027, which contained sequences representing the 

10 5' and 3' ends of the TEV genome. A unique Bgl II site was engineered immediately after 

the 25-nt 3* poly(A) tail by site-directed mutagenesis as described by KunkeL T.A., et al, 
Methods Enzymol, 154:367-382 (1982), incorporated by reference herein. A longer poly(A) 
tail (70-75 nt) was incorporated from pBB5995, resulting in pTEV7D. Plasmids were 
propagated in E. coli strain HB101. Figure 1 depicts diagrams of portions of plasmids 

15 containing cDNA representing the complete TEV genome and inserted GUS gene. The 

noncoding (open boxes) and coding (stippled shading) regions of the TEV genome, the GUS 
gene (hatched shading), and SP6 RNA polymerase promoter (black arrow) are shown. 
Vertical lines below maps indicate sequences coding for proteolytic processing sites, whereas 
selected restriction sites are indicated above maps. Abbreviations as used in this and other 

20 Figures are: N, Nco I; Bs, BstEll: Bg, 5g/II; 35, 35-kDa proteinase; HC, HC-Pro; 50. 50-kDa 

TEV protein; CI, cylindrical inclusion protein; 6, 6-kDa TEV protein; NIa. nuclear inclusion 
protein a; Nib, nuclear inclusion protein b; Cap, capsid protein; 7D, pTEV7D. 

Site-directed mutagenesis was used to introduce Nco I sites flanking the 5' and 
3' terminal coding sequences of the GUS gene in pTL7SN.3-GUS. The GUS coding region 

25 was excised with Nco I and inserted into the Nco I site that was introduced near the 
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beginning of the HC-Pro coding sequence in pTL7SN.3-0627. The GUS gene and adjacent 
TEV coding sequences from this plasmid were incorporated into pTEV7D. resulting in 
pTEV7D-GUS.HC (Fig. 1). The GUS gene was also inserted into the Nco I site that had 
been introduced previously at the beginning of the 35-kDa protein coding region in 
P TL7SN.3-0027. The GUS and adjacent TEV sequences from the resulting plasmid were 
introduced into pTEV7D, yielding pTEV7D-GUS.Pl (Fig. 1). 

Transcripts that were capped with 7-methylguanosine(5 , )triphospho(5') 
guanosine were synthesized using bacteriophage SP6 RNA polymerase and cesium chloride- 
purified, Bgl II-linearized plasmid DNA as described by Carrington, J.C., et ai, J. Virol.. 
64:1590-1597 (1990) , incorporated by reference herein. The 5' ends of transcripts from 
pTEV7D and related plasmids contained two additional nonviral nucleotides. Transcription 
mixtures were diluted with equal volumes (vol) of 10 mM sodium phosphate buffer, pH 7.4, 
and applied manually onto young tobacco plants (10 ul per leaf, two leaves per plant) with 
the aid of carborundum. In passage experiments, infected leaves were ground in 10 vol of 
10 mM sodium phosphate buffer containing carborundum and applied manually to leaves of 
healthy plants with a cotton swab. 

Fluorometric assays for GUS activity and measurements of protein 
concentration were conducted as described by Carrington, J.C., et al.,J. Virol., 64: 1 590-1597 
(1990), previously incorporated by reference herein. In situ GUS assays were done by using 
a colorimetric substrate according to Restrepo et al. Plant Cell, 2:987-998 (1990). 
incorporated by reference herein. Tobacco leaves were vacuum infiltrated with the substrate 
5-bromo-4-chloro-3-indolyI p-D-glucuronic acid, cyclohexylammonium salt (X-gluc) (1.2 
mM) in 0.5 mM potassium ferricyanide/10 mM EDTA. Manual cross-sections of leaf 
petioles, stems, and roots were placed directly into the substrate solution and photographed 
under bright-field optics by using a Zeiss photomicroscope. Total SDS-soluble proteins were 
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extracted from leaf tissue by grinding in 7 vol of protein dissociation buffer (0.625 M 
Tris-HCl. pH 6.8/2% SDS/10% 2-mercaptoethanol/10% (vol/vol) glycerol) and subjected to 
immunoblot analysis with anti-HC-Pro. anti-capsid, or anti-GUS sera by described 
procedures. 

5 Approximately 27% (15 of 56) of tobacco plants inoculated with capped RNA 

transcripts derived from pTEV7D became infected. Using the PCR, RNA isolated from 
progeny virions was shown to contain the sequence corresponding to the Nco I site 
engineered at the start codon for TEV polyprotein in pTEV7D, demonstrating the transcript- 
derived origin of infection. Twenty-five percent (16 of 64) and 0% (0 of 16) of plants 

10 inoculated with pTEV7D-GUS.HC and pTEV7D-GUS.Pl transcripts, respectively, became 

infected. Due to the absence of infectivity, pTEV7D-GUS.Pl was not used further. Virion 
RNA from pTEV7D-GUS.HC transcript-inoculated plants exhibited decreased electrophoretic 
mobility compared with RNA from wild-type TEV, suggesting that the GUS gene was 
retained in progeny virus. Plants infected by the modified virus, which will be referred to 

15 as TEV-GUS, lacked the vein clearing and etching typical of plants infected by wild-type 

TEV. 

The GUS gene in pTEV7D-GUS.HC was inserted adjacent to the coding 
sequence for the 35-kDa proteinase autoproteolytic cleavage site at Tyr-304/Ser-305 between 
the 35-kDa proteinase and HC-Pro. Insertion of GUS into the polyprotein at this position 
20 has been shown not to interfere with the 35-kDa proteinase processing activity. Proteolysis 

by 35-kDa proteinase and HC-Pro at their respective C- termini was predicted to yield a 
GUS-HC-Pro fusion protein of approximately 119 kDa. 

Figure 2 depicts the immunoblot analysis of extracts from RNA transcript- 
inoculated tobacco plants. Extracts from pTEV7D (7D; lanes 6, 10, and 14), two pTEV7D- 
25 GUS.HC (lanes 4, 5, 8. 9, 12, 13)-infected plants, and a mock-inoculated plant (M: lanes 7. 
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11, and 15) were analyzed by using anti-capsid protein. anti-HC-Pro. or anti-GUS serum. 
Lanes 4-15 each contained about 50 ug of SDS-soluble protein. A capsid protein 
concentration series (capsid standard: Cap. std.) consisting of 1.0 ug, 0.1 ug, and 0.01 ug 
(lanes 1-3. respectively) was also analyzed. The molecular masses (in kDa) of capsid protein 
(30 kDa). HC-Pro (52 kDa), and the GUS-HC-Pro fusion protein (1 19 kDa) are located at 
left. The minor immunoreactive bands in lanes 8-20, 12, and 13 represent degradation 
products. Plants inoculated with transcripts from pTEV7D, on the other hand, contained 
HC-Pro of normal size (52 kDa) that reacted only with anti-HC-Pro serum (lanes 10 and 14). 
Immunoblot analysis using anti-capsid serum indicated that the levels of capsid protein in 
plants inoculated with pTEV7D-GUS.HC and P TEV7D transcripts were similar (lanes 4-6). 
By comparing the level of immunoreactive protein in preparations of known protein 
concentration against a standard series, capsid protein was shown to account for about 1% 
of total protein in extracts from plants inoculated with transcripts from either plasmid (lanes 
1-6). Immunoblot analysis revealed accumulation of an approximately 1 19-kDa product that 
reacted with both anti-HC-Pro and anti-GUS sera (Fig. 2, lanes 8, 9, 12, and 13). 

The stability of the GUS insert within the TEV-GUS genome was assayed by 
immunoblot analysis in plant-to-plant passage experiments using virus originally recovered 
from two plants (plant 3 and 7) infected with RNA transcripts. All passages were performed 
4-6 days postinoculation (p.i.), except passage II (16 days p.i.). The 1 19-kDa GUS-HC-Pro 
protein was intact after seven passages of virus from plant 3 (Fig. 3, lanes 2, 4, 6, 8, and 1 1), 
suggesting stable retention of the GUS sequence. In Figure 3, samples of TEV-GUS were 
prepared from plants representing two independent virus lineages infected with pTEV7D- 
GUS.HC RNA transcripts and were analyzed using anti-HC-Pro serum. Extracts from TEV- 
GUS passages II-VII (lanes 1-11), pTEV7D transcript-inoculated (7D; lane 12), and mock- 
inoculated (M; lane 13) plants are shown in Figure 3. By use of a fluorometric assay. GUS 
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activity levels were determined to be comparable in plants from each passage, as well as from 
an additional seven passages. In contrast, TEV-GUS from plant 7 sustained a deletion in the 
GUS-HC-Pro coding region that was evident during passage IV. resulting in appearance of 
an anti-HC-Pro reactive protein that was about 7 kDa smaller than intact HC-Pro (lanes 7, 
5 9, and 10). This form was also detected in the remaining three passages (lanes 1, 3, and 5). 

The disappearance of the GUS-HC-Pro protein correlated with a loss of GUS activity, as well 
as a reversion to a mosaic- and etch-inducing phenotype, in plants from passages V-VII. 

The stability of TEV-GUS as a function of time after inoculation was tested 
by using virus from passage line 3. Upper systemic leaves from passage V and VI plants 
10 were screened for GUS-HC-Pro fusion protein by immunoblot analysis and for GUS activity 

by using a fluorometric assay at intervals after inoculation. Figure 4 depicts analysis of 
GUS-HC-Pro fusion protein and GUS activity in aging plants infected by TEV-GUS. In 
Figure 4(A), extracts were prepared at the times p.i. indicated from upper systemic leaves of 
passage V and VI plants infected by TEV-GUS line 3 and were subjected to immunoblot 
15 analysis with anti-HC-Pro serum (lanes 1-6). Extracts were also analyzed from pTEV7D 

(7D; lane 7) and mock-inoculated (M; lane 8) plants. In Figure 4(B), GUS activities as 
determined by fluorometric assay with extracts analyzed in panel A are shown. Deletions 
that resulted in truncated fusion proteins were detected 20 and 25 days p.i. in passage VI and 
V plants, respectively (Fig. 4A, lanes 2 and 5, respectively). Additional deletion variants 
20 encoding HC-Pro-related proteins smaller than wild-type HC-Pro were evident after an 

additional 5 days in each plant (lanes 1, 4, and 7). The appearance of deleted forms 
correlated with decreased amounts of both the intact 119-kDa fusion protein (compare lanes 
1 and 2 with lane 3, and lanes 4 and 5 with lane 6) and GUS activity (Fig. 45). 

Plants infected by TEV-GUS were identified in situ localization of GUS 
25 activity. Whole leaves were vacuum-infiltrated with the histochemical GUS substrate X-gluc 
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at various times. Plants were innoculated with TEV-GUS at 100 ug/ml. Young tobacco 
plants were mechanically inoculated with TEV-GUS. and leaves were excised and vacuum- 
infiltrated with GUS substrate X-gluc at several time points post-inoculation (p.i.). 
Macroscopic and microscopic visualization of the leaves demonstrated GUS activity. Single 
infected epidermal cells that contained indigo GUS reaction product were identified 12 hr p.i. 
on primary inoculated leaves. By 24 hr p.i., movement to adjacent epidermal and mesophyll 
cells had resulted in infection foci that extended to approximately 10 cells in diameter and 
that were visible by eye. These foci continued to expand and eventually fused by 96 hr p.i.. 
Measurements of cell-to-cell movement over time indicated that focus expansion occurred at 
a rate of approximately one cell per 2 hr. Activity of TEV-GUS was detected around 
segments of vascular tissue in systemically (noninoculated) infected leaves by 60 hr p.i.. 
After 72 hr p.i., virus movement was evident along major and minor veins and into leaf 
mesophyll cells adjacent to vascular tissue. Spread of virus through mesophyll tissue between 
veins proceeded at a rate comparable to that measured on inoculated leaves. 

To determine which cells or cell types were infected first during systemic 
spread cross-sections from stems, roots, and leaf petioles above the site of inoculation were 
incubated in X-gluc and visualized by light microscopy. This was conducted by /„ situ 
localization of GUS activity in cross-sections of petioles, stems, and roots of plants infected 
by TEV-GUS. Sections were cut by hand at various times p.i. (hr) and incubated with the 
histochemical substrate X-gluc. No TEV-GUS activity was detected at 24 and 48 hr p.i. in 
any of the organs. By 72 hr p.i., however, clusters of phloem-associated cells in each of the 
three organs exhibited activity. Nonphloem-associated cell types, such as xylem parenchyma 
and cortex, were free of detectable TEV-GUS. Ingress into these cell types was evident after 
96 hr p.i.. Strikingly, tissues of lateral roots contained especially high levels of activity. 
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The TEV-GUS system demonstrated that potyviruses have utility as replicating 
vectors for the introduction and expression of foreign genes in plants. For TEV. capsid 
protein accounts for about 1% of SDS-soluble protein in infected leaves (Fig. 2). Unlike 
most other plant RNA virus vectors, the potyvirus-based system permitted both efficient 
systemic spread and high insertion capacity. 

Example 2 

This example shows the spontaneous mutagenesis of a plant poty virus genome 
after insertion of a foreign gene, as described in Doija, V.V. et aL, J. Virol. 67: 5968-5875 
(1993), incorporated by reference herein. 

Routine mechanical inoculation of young Nicotiana tabacum cv. Xanthi nc 
plants with wild-type TEV-HAT (high aphid transmissibility and TEV mutants was 
performed with the aid of carborundum using homogenates prepared by grinding infected 
leaves in 2 vol of 10 mM Tris-HCl, 1 mM EDTA, pH 7.6. Homogenates were aliquoted, 
frozen at -85° C, and thawed prior to inoculation as needed. In some cases, crude virus 
preparations from individual infected plants were obtained using the same method, but with 
exclusion of the CsCl-gradient step. Viral RNA was purified using proteinase K digestion, 
phenol extraction, and ethanol precipitation as described by Carrington, J.C., et al. 9 Virology, 
139:22-31 (1984), incorporated by reference herein. 

Complementary DNA spanning the region of deletion in the GUS-HC-Pro 
coding sequence was obtained by reverse transcription of RNA from crude virus preparations 
followed by PCR. Reverse transcription and PCR were conducted according to established 
procedures as described by Kawasaki, E.S., In PCR Protocols, pp. 21-27 [Academic Press, 
Inc., San Diego (1990)], incorporated by reference herein, except that reverse transcriptase 
buffer from Gibco/BRL was used, rather than PCR buffer in the initial step. The first-strand 
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primer was complementary to TEV nucleotides 1456-1477 within the HC-Pro coding region 
while the second-strand primer corresponded to nucleotides 781-799 within the PI region. 
The first and second-strand primers contained Hindlll and BamHl recognition sequences, 
respectively, to facilitate insertion into the vector pTL7SN. All plasmids were propagated 
in E. coli strain HB101. The sequence of the insert DNA spanning the deletion endpoints 
in recombinant plasmids was determined using the Sequenase R kit (U.S. Biochemicals) and 
a primer corresponding to TEV nucleotides 1015-1032. Each spontaneous deletion variant 
identified was assigned a code (ldel, 2del. etc.). 

The sequences between nucleotides 663 to 2416 in the genomes of deletion 
variants TEV-2del and TEV-7del were reverse transcribed, amplified by PCR. and digested 
with SnaBl and EcoRI which recognize sites within the PI and HC-Pro coding regions, 
respectively. The cleaved PCR fragments were cloned into SnaBl and £o>RI-digested 
pTL7SN.3-0027DA, which contains sequences corresponding to the 5'-terminal 2681 and 3'- 
terminal 154 nucleotides of TEV genome. This plasmid was derived from pTL7SN.3-0027D 
by removal of two nonviral nucleotides separating the 5'-terminus of the TEV sequence from 
the start site of SP6 RNA polymerase transcription. The resulting plasmids, therefore, 
possessed the sequences surrounding the deletion breakpoints that were present in TEV-2del 
and TEV-7del. To generate full-length copies of TEV genomes containing these deletions, 
the BstEU-BstEU fragment corresponding to TEV nucleotides 1430-9461 was isolated from 
pTEV7D, and was inserted into 5*/EII-digested derivatives of pTL7SN.3-0027DA. Figure 
5 depicts genetic maps of TEV and selected TEV variants used in this study. Figure 5(A) 
shows wild-type TEV-HAT as derived from transcription of pTEV7DA. The single open 
reading frame is indicated by the extended, light gray box. Positions encoding polyprotein 
cleavage sites are shown by the vertical lines. The names of individual proteins are shown 
above the map. Figure 5(B) depicts TEV-GUS and deletion variants. TEV-GUS differs 
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from TEV-HAT in that the former contains the coding region for ^-glucuronidase (GUS) 
inserted between the PI and HC-Pro sequences. The polyprotein encoded by TEV-GUS 
contains a functional PI cleavage site between PI and GUS. resulting in formation of a GUS- 
HC-Pro fusion protein. The sequences missing in the deletion variants of TEV-GUS are 
5 shown in the expanded diagram. The numbers indicate the nucleotide positions of the 

deletion endpoints in the GUS and HC-Pro coding sequences. Abbreviations as used in this 
and other Figures are: HC-Pro ? helper component-proteinase; CI, cylindrical inclusion protein; 
NIa, nuclear inclusion protein ; a'; Nib, nuclear inclusion protein 'b ¥ ; Cap, capsid protein. 
These plasmids were named pTEV7DA-2del f and pTEV7DA-7del r (Fig. 5B). Viruses 

10 derived from transcripts of these plasmids were designated TEV-2del r and TEV-7del f (the 

V subscript indicates reconstructed virus). As a control, the same manipulations (reverse 
transcription, PCR and subcloning) were performed with the wild-type TEV-HAT genome, 
resulting in plasmid pTEV7DA f . 

The deletion mutant TEV-2del (and 2del r ) retained nucleotides 1-135 from 

15 GUS but lost HC-Pro sequence up to nucleotide 207 (Fig. 5B). Two additional deletion 

variants were generated by site-directed "loop out" mutagenesis of pTEV7DA-2del r The 
GUS sequence between nucleotides 10-135 was removed, resulting in pTEV7DA-AGUS. 
The HC-Pro deletion in pTEV7DA-2del r was enlarged by removal of sequence corresponding 
to HC-Pro nucleotides 208-265, yielding pTEV7DA-AHC (Fig. 5B). 

20 RNA transcripts capped with m 7 GpppG were synthesized using bacteriophage 

SP6 RNA polymerase (Ambion) and CsCl gradient-purified, 5g/II-Iinearized plasmid DNA 
as described by Carrington, J.C., et al, J. Virol, 64:1590-1597 (1990), previously 
incorporated by reference herein. The undiluted transcription mixtures were applied manually 
onto leaves of young tobacco plants (10(il per leaf, two leaves per plant) that were dusted 

25 with carborundum. 
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Total SDS-soluble proteins were extracted from leaf tissue by homogenization 
in 4 vol of protein dissociation buffer (0.625 M Tris-HCl. pH 6.8, 2% SDS. 10% 2- 
mercaptoethanol. 10% glycerol) and subjected to immunoblot analysis with anti-capsid or 
anti-HC-Pro sera. Quantitation of capsid protein levels in leaf extracts was done using capsid 
protein standards from purified virus and densitometry of immunoblots with a Model 620 
Video Densitometer (Bio-Rad). 

Isolation of total RNA from leaf tissue was as described previously by 
Carrington, J.C., et al., Virology 139:22-31 (1984), previously incorporated by reference 
herein. RNA (5 ug) was denatured by glyoxal treatment, separated by electrophoresis 
through a 1% agarose gel, and blotted onto a GeneScreenP/us™ membrane (DuPont). 
Northern hybridization was performed using a randomly-primed, 32 P-labeled probe prepared 
from a DNA fragment corresponding to TEV nucleotides 1430-9461. The amount of 
radioactivity bound to each lane was measured using a Beckman LS 5801 Scintillation 
Counter. 

For transmission directly from infected leaves, aphids (Myzus persicae Sulz) 
were allowed a 5-10 min acquisition access period and then transferred to tobacco {Nicotiana 
tabacum cv. Ky 14) seedlings. Ten aphids were placed on each often test plants for each 
treatment in each experiment. In helper component prefeeding experiments, aphids were first 
allowed a 5-10 min acquisition access period to a concentrated preparation of potato virus 
Y helper component through a parafilm membrane. Aphids were then transferred to a TEV- 
infected leaf and processed as described above. Infection of test plants was scored visually. 

As demonstrated previously in Example 1, the transfer of TEV-GUS from 
plant-to-plant at intervals of four to six days resulted in stable retention of the GUS insert 
over 7 passages. Continuation of this experiment up to 21 passages during approximately 
120 days further demonstrated the stability of the foreign insert under these conditions. 
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Figure 6 depicts immunoblot analysis of extracts from upper leaves of TE V-GUS-inoculated 
tobacco plants using anti-HC-Pro sera. Each lane contains a sample from an individual plant 
inoculated with TEV-GUS. TEV-HAT, or buffer (lane M). All plants were propagated 4-5 
weeks post-inoculation except the plant labeled TEV-GUS, which was propagated for 6 days 
post-inoculation. Each of the samples in lanes 1-12 contained a TEV-GUS-derived. 
spontaneous deletion variant that encoded a truncated GUS-HC-Pro fusion protein. Lanes 
4 and 3 contained extracts from plants representing first and second passages. respectively ; 
that derived from a single TEV-GUS-infected plant. Each lane was loaded with equivalent 
amounts of extract. The weak appearance of HC-Pro-related proteins on lanes 1, 5. 7, and 
12 was due to their low yield or rapid turnover. The molecular weights in kilodaltons (kDa) 
of wild-type HC-Pro and GUS-HC-Pro fusion protein are shown in the Figure. Immunoblot 
analysis of total protein from the 21st passage plant revealed the full-size, GUS-HC-Pro 
fusion product (Fig. 6). The level of GUS activity in this plant was comparable to that in 
plants from the early passages. In contrast, prolonged propagation (3 to 4 weeks) of virtually 
all TEV-GUS infected plants led to deletion mutants encoding HC-Pro-related proteins that 
were considerably smaller than GUS-HC-Pro (Fig, 6, lanes 1-12). Several of these variants, 
all of which were found in systemically infected leaves, expressed HC-Pro forms that were 
smaller than the wild-type protein (52-kDa), indicating that these mutants contained deletions 
of GUS and HC-Pro sequences. In some cases, apparent intermediate deletion mutants were 
identified that, upon further passage, were replaced by variants containing larger deletions 
(Fig. 6, compare lanes 4 and 3). In control plants infected for five weeks by wild-type TEV- 
HAT, no deletions of HC-Pro were evident (Fig. 6), suggesting that the deletion phenomenon 
was specific to TEV carrying the foreign gene. 

Virions and corresponding RNA were purified from six individual plants 
infected by TEV-GUS deletion variants expressing HC-Pro-related proteins of various sizes. 



- 24 - 



BNSDOCID: <WO 9512669A1 J_> 



WO 95/12669 



PCTYUS94/12836 



The regions overlapping the deletion sites were copied by reverse transcription, amplified by 
PCR. cloned in a plasmid, and sequenced. Analysis of 40 cDNA clones revealed 9 different 
deletion variants (Table 2). 
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TABLE 2. 


Summary of TEV-GUS deletion mutants. 


Deletion 
p j a 


Deletion 
lengm 


Deletion 5' end 
within GUS b 


Deletion 3' end 
within HC-Pro c 


Number of 

cDNA clones 


1 An\ 


1 787 
1 / o / 


31(G) 


no deletion 


5 


zael 




135(T) 


207(G) 


1 


_>aei 


i 5Z5 


80(A) 


90(A) 


1 


4aei 




66(T) 


187(G) 


9 


Duel 


1 7/11 


92(A) 


15(C) 




6del 


1789 


81(T) 


52(T) 


2 


7del 


2074 


9(T) 


265(C) 


10 


8del 


1804 


91(G) 


77(C) 


8 


9del 


1840 


84(G) 


106(A) 


1 


a Variants 4del, 5del, and 6del were isolated from the same plant, as were 
variants 8del and 9del. All other variants were from individual plants. Note: 
the numbering of deletions does not correspond to the numbering of lanes in 
Fig. 6. 


b Numbering starts with the first nucleotide of the GUS coding sequence. The 
last GUS-derived nucleotide is shown in parentheses. 


c Numbering starts with the first nucleotide of the HC-Pro coding sequence, 
which corresponds to TEV genome nucleotide 1057. The first HC-Pro- 
derived nucleotide is shown in parentheses. 


^ Number of plasmids sequenced after cloning of PCR products. 



The deletions ranged from 1741 to 2074 nucleotides in length, showing almost complete loss 
of the GUS gene in all mutants. Each mutant, however, retained a short segment of between 
25 9 and 135 nucleotides from the 5' end of the GUS sequence. All but one of the variants also 

was lacking between 15 and 265 nucleotides from the 5' terminal region of the HC-Pro 
coding sequence. Interestingly, in no case were the 5' and 3' deletion endpoints entirely 
within the GUS coding region. All deletions that were mapped resulted in preservation of 
the open reading frame, and in some examples, led to the formation of new hybrid codons 
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at the junction sites. Two of the six plants analyzed contained multiple deletion mutants, 
although in both instances one type predominated among the population recovered (Table 2). 
No mutants contained multiple deletions, nor were non-GUS or non-HC-Pro sequences 
located at the junctions, suggesting that the deletions were generated by removal of 
5 contiguous genome segments. 

Figure 7 depicts relative positions of deletion endpoints within the GUS and 
HC-Pro coding sequences in spontaneous mutants of TEV-GUS. Each arrow indicates a 
unique endpoint identified by sequence analysis. The relative positions of the 5' and 3' 
deletion endpoints are presented diagrammatically in Fig. 7. All of the 5' endpoints clustered 

1 0 within the 130 nucleotide region of GUS gene, whereas the 3' endpoints all mapped to the 

first 265 nucleotides of the HC-Pro sequence. Alignment of sequences flanking the endpoints 
of deletions failed to reveal any common elements of primary structure. In fact, the extent 
of homology of sequences surrounding the junction sites did not exceed two nucleotides in 
any of the mutants. Similarly, no nucleotide preferences at the deletion endpoints were 

1 5 evident (Table 2). 

The fact that the deletion mutants were able to spread systemically suggested 
that the HC-Pro region lacking in the mutants was not essential for virus replication and 
intraplant movement. Since it was possible that these plants also contained a low level of 
nondeleted TEV-GUS providing functions necessary for these processes, uniform virus 
populations were prepared for the two variants (2del and 7del) containing the largest HC-Pro 
deletions. An approximately 350 nucleotide segment covering the deletion site in both 
genomes was amplified by PCR and inserted into pTEV7DA, generating pTEV7DA-2del r and 
pTEV7DA-7del r . The resulting constructs harbored the deletion in a wild-type TEV-HAT 
background and lacked any other sequence changes potentially present in the original 



20 
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mutants. As a control, the corresponding genome segment from wild-type TEV-HAT was 
amplified and cloned in pTEV7DA to produce pTEV7DA r 

Between 60 and 100% of plants inoculated with synthetic transcripts from 
pTEV7DA r pTEV7DA-2del r and pTEV7DA-7del r became infected systemically. 

5 demonstrating conclusively that the mutants were capable of replication and movement 

independent of a non-deleted TEV-GUS. The appearance of systemic symptoms at four days 
post-inoculation was observed in plants infected by TEV-HAT and the deletion mutants. 
Systemic symptoms in plants inoculated with TEV-GUS, on the other hand, required five 
days post-inoculation. Figure 8 depicts immunoblot analysis with anti-HC-Pro serum of 

10 extracts from plants inoculated with synthetic RNA transcripts. Figure 8(A) depicts extracts 

which were from systemically infected leaves of three plants inoculated with transcripts from 
pTEV7DA-HAT r (lanes 1-3) and three plants inoculated with transcripts from pTEV7DA- 
7del r (lanes 4-6). Figure 8(B) shows extracts from a mock-inoculated plant (lane 1), plants 
inoculated with virus preparations of wild-type TEV-HAT (lane 2) and TEV-2del (lane 6), 

15 or synthetic RNA transcripts from pTEV7DA-2del r (lane 3), pTEV7DA-AGUS (lane 4) and 

pTEV7DA-AHC (lane 5). Each lane was loaded with an equivalent amount of extract. The 
positions of HC-Pro and truncated HC-Pro derivatives are shown at the left of each panel. 
Immunoblot analysis using anti-HC-Pro serum indicated that the two reconstructed deletion 
variants encoded truncated HC-Pro-related proteins, whereas the reconstructed wild-type virus 

20 expressed a normal size product (Fig. 8A and 8B, lane 3). 

Although the N-terminal HC-Pro sequences clearly were not essential for virus 
viability, it was noticed that the levels of HC-Pro appeared to be considerably less in plants 
infected by the mutants compared to wild-type TEV-HAT. Also, the level of GUS-HC-Pro 
fusion protein in TEV-GUS-infected plants appeared less than HC-Pro in plants infected by 

25 TEV-HAT. To determine more precisely the levels of replication of wild-type and mutant 
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10 



15 



20 



25 



viruses in systemically infected tissue, the relative amounts of capsid protein and virus RNA 
were measured using immunoblot and Northern blot analyses, respectively. 

Figure 9 depicts quantitation of TEV capsid protein (panels A and B) and 
RNA (panels C and D) present in systemically infected leaves of plants inoculated with wild- 
type and mutant viruses. Panels 9(A) and 9(B) show capsid protein levels in total protein 
extracts were determined by reflective densitometry of immunoblots using capsid protein 
standards from purified virions. Each bar represents the mean (and standard deviation) of 
three independent samples. Panels 9(C) and 9(D) depict relative viral RNA levels in total 
RNA extracts determined by measuring radioactivity from a 32 P-labeled probe bound to 
Northern blots. Two independent samples are shown in each case. For comparative 
purposes, note that data shown in panels A and C were collected from contemporaneous 
samples in one experiment, while data in panels B and D were from another set of 
contemporaneous samples. The insertion of GUS into the viral genome in TEV-GUS 
decreased the yield of capsid protein and RNA to levels of 15 and 9%, respectively, 
compared to wild-type (Fig. 9A and C). The presence of GUS, therefore, had a debilitating 
effect on accumulation of virus. TEV-7del, which specified three amino acid residues from 
GUS but possessed an HC-Pro deletion to residue 89, yielded even less viral protein and 
RNA at 8 and 6% of the wild-type levels, respectively (Fig. 9A and C). In contrast, plants 
infected by TEV-2del, which coded for 45 residues of GUS but lacked 66 HC-Pro residues, 
accumulated capsid protein and RNA at 25 and 17% of the wild-type levels, respectively 
(Fig. 9A and C). TEV-2deI, therefore, replicated to a degree approximately three times 
higher than TEV-7del. 

The differences between the two spontaneous mutants may have been due to 
the presence of the short GUS sequence having a stabilizing effect in TEV-2del r or to the 
larger deletion within TEV-7del affecting a critical function of HC-Pro. To distinguish 
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between these two possibilities, two site-directed mutations were introduced into pTEV7DA- 
2del f . In the first, pTEV7DA-AGUS, the GUS sequence between nucleotides 10-135 was 
deleted, leaving a short GUS region equivalent to that present in TEV-7del. In the second, 
pTEV7DA-AHC, the HC-Pro deletion was extended from nucleotide 207 to 265. 
5 corresponding to the sequence missing in TEV-7del. Immunoblot analysis with HC-Pro- 

specific antibodies revealed the truncated HC-Pro-related products of the expected sizes in 
plants inoculated with transcripts from the two mutants (Fig. 8B, lanes 4 and 5). Based on 
the apparent quantities of HC-Pro deletion products in these plants, TEV-AGUS replicated 
better than TEV-2del r and TEV-AHC. This was confirmed by quantitation of the levels of 

10 capsid protein and RNA in systemic leaves. Compared to TEV-2del r TEV-AGUS directed 

an average of approximately 70% more capsid protein and RNA, while TEV-AHC directed 
less of these products (Fig. 9B and D). This demonstrated that the presence of the GUS 
fragment, and the absence of an additional HC-Pro sequence, both conferred a debilitating 
effect on virus accumulation. 

1 5 The ability of aphids to transmit TEV-HAT, TEV-GUS ? TEV-2del and TEV- 

7del was tested. Transmission assays demonstrated that only TEV-HAT possessed an aphid- 
transmissible phenotype (Table 3). 
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TABLE 3. Aphid transmission of wild-tvpe and mutant TEV variants 


Virus 


Without HC-Pro prefeeding 


With HC-Pro prefeeding 3 


TEV-HAT 


34/40 b 


10/10 


TEV-GUS 


0/40 


0/20 


TEV-2del 


0/30 


4/30 


TEV-7del 


0/30 


0/20 


a 


Aphids were allowed access to active HC-Pro from potato virus Y prior to 
transmission assays. 


b 


Data are shown for infected plants/total plants assayed. 



10 



15 



The lack of transmission of TEV-GUS, TEV-2del and TEV-7del could have 
been due to functional defects in the mutant HC-Pro proteins, and/or to the decrease in 
mutant virus titers and corresponding low concentrations of HC-Pro. Prefeeding of aphids 
on concentrated preparations of HC-Pro can facilitate transmission of mutant viruses 
expressing defective HC-Pro proteins. Using highly active preparations of potato virus Y 
HC-Pro, aphid transmission of TEV-2del was restored, although with low efficiency (Table 
3). The inability to transmit TEV-GUS or TEV-7del with the aid of prefeeding was due 
most likely to low levels of virus accumulation (Fig. 9A, C). 



20 



Example 3 

This is a prophetic example. The gene encoding the sequence for human 
insulin is inserted into a TEV vector as described for GUS in Example 1. The expression 
vector is applied manually to plants as in Example 1 to produce human insulin. Human 
insulin is extracted from the plants. 
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Many other variations and modifications may be made in the methods herein 
described, by those having experience in this art. without departing from the concept of the 
present invention. Accordingly, it should be clearly understood that the methods described 
in the foregoing description are illustrative only, and not intended as a limitation on the scope 
5 of the invention. 
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CLAIMS 

WE CLAIM: 

1. A viral expression vector encoding for at least one protein non-native to the 
vector that is released from at least one polyprotein expressed by said vector by proteolytic 
5 processing catalyzed by at least one protease in said polyprotein, said vector comprising: 

(a) at least one promoter; 

(b) cDNA, wherein said cDNA comprises a cDNA sequence which codes for 
at least one polyprotein from a polyprotein-producing virus; 

(c) at least one unique restriction site flanking a 3' terminus of said cDNA; 

10 and 

(d) a cloning vehicle. 

2. The expression vector according to claim 1 wherein said cDNA is from a plant 

virus. 

3 The expression vector according to claim 2 wherein said plant virus is selected 
15 from the group consisting of potyviruses, comoviruses, and nepoviruses. 

4. The expression vector according to claim 2 wherein said plant virus is at least 
one potyvirus. 

5. The expression vector according to claim 2 wherein said plant virus is tobacco 
etch virus. 
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6. A viral expression vector encoding for at least one protein non-native to the 
vector that is released from at least one polyprotein expressed by said vector by proteolytic 
processing catalyzed by at least one protease in said polyprotein, said vector comprising: 

(a) at least one promoter; 
5 (b) cDNA, wherein said cDNA comprises a cDNA sequence which codes for 

at least one polyprotein from a polyprotein-producing virus, and wherein said 
polyprotein comprises at least one protein non-native to the vector; 

(c) at least one unique restriction site flanking a 3' terminus of said cDNA; 

and 

10 (d) a cloning vehicle. 

7. The expression vector according to claim 6 wherein said cDNA is from a plant 

virus. 

8. The expression vector according to claim 6 wherein said plant virus is selected 
from the group consisting of potyviruses, comoviruses, and nepoviruses. 

15 9. The expression vector according to claim 7 wherein said plant virus is at least 

one potyvirus. 

1 0. The expression vector according to claim 7 wherein said polyprotein-producing 
plant virus is tobacco etch virus. 
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1 1 . The expression vector according to claim 6 wherein said cDN A sequence for 
said polyprotein has a coding sequence for said protein non-native to the vector inserted 
between coding sequences for proteins_native to said polyprotein-producing virus. 

12. A method for producing an expression vector encoding for at least one selected 
5 protein non-native to the vector that is released from at least one polyprotein expressed bv 

said vector by proteolytic processing catalyzed by proteases in said polyprotein, said method 
comprising: 

(a) reverse transcribing a polyprotein-producing RNA into cDNA; 

(b) introducing at least one unique restriction site flanking a 3' terminus of 
10 said cDNA; and 

(c) inserting said cDNA into a cloning vehicle. 

13. The method according to claim 12 wherein said polyprotein-producing RNA 
is from a plant virus. 

14. The method according to claim 13 wherein said plant virus is selected from 
1 5 the group consisting of potyviruses, comoviruses, and nepoviruses. 

15. The method according to claim 13 wherein said plant virus is at least one 
poty virus. 

16. The method according to claim 13 wherein said polyprotein-producing plant 
virus is tobacco etch virus. 
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17. A method for producing an expression vector, adapted for expressing in a host 
cell at least one selected protein non-native to the vector that can be released from at least 
one polyprotein by proteolytic processing catalyzed by proteases in said polyprotein, said 
method comprising: 

(a) reverse transcribing a poly protein-producing RNA into a first cDNA; 

(b) introducing at least one unique restriction site flanking a 3' terminus of 
said first cDNA; and 

(c) inserting into said first cDNA a second cDNA sequence wherein said 
second cDNA sequence encodes a protein non-native to the vector; and 

(d) inserting said first and second cDNA into a cloning vehicle. 

18. The method according to claim 17 wherein said polyprotein-producing RNA 
is from a plant virus. 

19. The method according to claim 18 wherein said plant virus is selected from 
the group consisting of potyviruses, comoviruses, and nepoviruses. 

15 20. The method according to claim 18 wherein said plant virus is at least one 

potyvirus. 

2 1 . The method according to claim 1 8 wherein said polyprotein-producing plant 
virus is tobacco etch virus. 

22. A method for expressing at least one protein in a host cell, wherein the protein 
20 is non-native to the host cell, said method comprising infecting a host cell susceptible to a 
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polyprotein-producing virus with an expression vector, wherein said vector comprises a 
cDNA sequence encoding at least one polyprotein, each of said polyprotein comprising at 
least one protein non-native to the vector and at least one protease, and wherein said protein 
non-native to the vector is released from said polyprotein by proteolytic processing catalyzed 
by said protease encoded by said cDNA, and expressing said protein non-native to the vector 
in said host cell. 

23. The method according to claim 22 wherein said protein non-native to the 
vector is a pharmaceutical. 

24. The method according to claim 22 wherein said protein non-native to the 
vector is selected from the group consisting of insulin, hGH, interleukin, EPO, G-CSF, GM- 
CSF, hPG-CSF, M-CSF, Factor VIII, Factor IX, and tPA. 

25. The method according to claim 22 wherein said protein non-native to the 
vector metabolically interacts with a compound that occurs in the host cell to produce a 
secondary metabolite of said compound. 

26. The method according to claim 22 wherein said host cell is a plant cell and 
wherein said polyprotein is from a plant virus. 

27. The method according to claim 26 wherein said plant virus is selected from 
the group consisting of potyviruses, comoviruses, and nepoviruses. 
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28. The method according to claim 26 wherein said plant virus is at least one 
potyvirus. 

29. The method according to claim 26 wherein said plant virus is tobacco etch 

virus. 
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